( 



PATENT 



IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 



TITLE 

UTILIZATION-BASED POWER MANAGEMENT OF A CLOCKED DEVICE 



ASSIGNEE 

Compaq Information Technologies Group, L.P. 

a Delaware corporation 
20555 St. Highway 249, Building CCA 11 
Houston, Texas 77070 



CERTIFICATE OF MAILING 37 C.F.R. 1.10 

"Express Mail" Mailing Label Number EL827179503US 



I hereby certify that this correspondence is being deposited with the United States Postal Service "Express Mail 
Post Office To Addressee" service with sufficient postage on the date indicated below and is addressed to: U.S. 
Patent and Trademark Office, Washington, D.C. 20231. 

Name of Person Signing Certificate: Rochelle M. Pleasant 



(Attorney Docket No. H052617.1298US0) 



INVENTOR 



Lee W. Atkinson 



14418 Ravenhurst 



Houston, Texas 77070 




052617.1097 HOUSTON 243288 vl (P-295-5 CPCG-PPC) 



PATENT APPLICATION 
(Continuation of Patent Application No. 10/224,147) 

for 

UTILIZATION-BASED POWER MANAGEMENT OF A CLOCKED DEVICE 

by 

Lee W. Atkinson 



Assignee: 

Hewlett-Packard Development Company, LP 
20555 S.H. 249 
Houston, TX 77070 



Address for USPTO Correspondence: 

IP Administration 
Legal Department, M/S 35 
Hewlett-Packard Company 

P.O. Box 272400 
Fort Collins, CO 80527-2400 



Attorney Docket No. 200302691-14 



052617.1 298 (P-295-5 CPCG-PPC) 



APPLICATION FOR PATENT 
INVENTORS: LEE WARREN ATKINSON 

TITLE: UTILIZATION-BASED POWER MANAGEMENT OF A 

CLOCKED DEVICE 



SPECIFICATION 

CROSS-REFERENCE TO RELATED APPLICATIONS 

[0001] This application is a continuation of co-pending U.S. application Serial No. 
09/818,981, filed March 27, 2001, which is a continuation U.S. application Serial No. 
09/480,101, filed January 10, 2000, now U.S. Patent No. 6,233,671, which is a continuation 
U.S. application Serial No. 08/166,609, filed December 13, 1993, now U.S. Patent No. 
6,029,249, which is a continuation of U.S. application Serial No. 08/473,655, filed June 7, 
1995, now U.S. Patent No. 5,625,826, which is a file wrapper continuation of U.S. 
application Serial No. 07/809,301, filed December 17, 1991, now abandoned, which are 
incorporated herein for reference. 

STATEMENTS REGARDING FEDERALLY 
SPONSORED RESEARCH OR DEVELOPMENT 

[0002J Not applicable. 

REFERENCE TO A MICROFICHE APPENDIX 
[0003] Not applicable. 



BACKGROUND OF THE INVENTION 

1 . Field of the Invention 
[0004] The invention relates to battery powered computer systems, and more particularly, 
to circuits and methods for reducing the power consumption of the computer system. 
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2 . Description of the Related Art 
[0005] Portable computer systems are rapidly developing the capabilities of conventional 
desktop or floor mounted personal computer systems. Hard disk units are being integrated 
into portable computers because of the large amounts of information being processed and the 
large size of many application programs. A floppy disk unit is integrated in the vast majority 
of portable computers, even if a hard disk unit is installed, to allow loading of information 
and use of applications requiring key disks, and also to allow use of diagnostic programs. 
Modems have been integrated into portable computers for some time to allow 
communications and information transfer between the user and a remote location, for 
example, the home office. The displays in portable computer systems are becoming much 
more elaborate and readable. The pixel count on the standard liquid crystal displays (LCD's) 
utilized is increasing, as is the viewing angle. The use of backlighting allows use of LCD's in 
low light environments and improves the contrast ratio of the display. More complex 
circuitry is being installed in portable computers to support these improved peripheral devices 
and to support the increased speeds and capabilities of the microprocessors utilized in 
portable computer systems. 

[0006] The various peripheral devices and high speed circuitry mentioned above consume 
large amounts of power when operating. This has resulted in problems in portable computer 
systems because these systems are generally desired to be used in locations where alternating 
current is not available. This has made it very difficult to provide all the possible 
functionality available and yet have an acceptable battery life when the portable computer 
system is battery powered. Using CMOS components helped redyce the power consumption 
of the circuitry, but even the use of CMOS components is insufficient at the clock speeds and 
performance levels of available circuitry. Therefore a dilemma arises whether to provide 
lesser functionality with longer battery life or greater functionality with lesser battery life or 
even no battery operation. 

[0007] Various alternatives were tried to resolve the problem. For example, the 
International Business Machines (IBM) Corporation PC Convertible included a switch which 
the user could press to place the computer system in a standby mode. However, the PC 
convertible was relatively simple, with a low level of functionality as compared to what is 
currently available, and the requirement of a user action to reduce power consumption limited 
its use to circumstance's where the user remembered to depress the switch. Blanking the 
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display after a period of keyboard inactivity saved power as well as prolonged the life of the 
display and was widely utilized. A hard disk unit was developed which reduced the power 
used by the controlling electronics by utilizing only certain portions of the track for servo 
information and turning off the read channel circuitry until just before a servo burst was 
expected. Additionally, a programmable value could be provided to the hard disk unit so that 
after a given inactivity interval defined by this value, the hard disk unit was allowed to spin 
down and all but some interface circuitry was shut down. While these alternatives did 
provide some relief, they were not complete solutions to satisfactorily resolve the dilemma, 
and design tradeoffs still were forced to occur. 

[0008] U.S. Patent No. 4,980,836 to Carter et al. discloses an apparatus for reducing 
power consumption in computer systems. The apparatus monitors the address bus to 
determine when selected peripheral devices have not been accessed for a preset amount of 
time. When the preset amount of time has passed, the system powers itself down and 
disables the system clock, placing the system in a standby mode. The system clock could be 
stopped in this invention because the preferred embodiment of this invention used a static 
CMOS processor and chip set. If there was sufficient energy in the batteries, the system 
could be awakened by the user depressing a standby switch. Computer systems which do not 
use a static CMOS processor or chip set generally reduce the clock frequency when a preset 
amount of time of address bus inactivity has passed. Reducing the clock frequency during 
inactive periods reduces power consumption during this time. However, the frequency of 
peripheral device accesses is not a completely reliable indicator of inactivity of a computer 
system. Thus, in some instances the system clocking signal may be reduced in frequency or 
disabled during a period of high computer system activity. Therefore, a method is needed 
whereby other elements or events of the computer system can be monitored to more reliably 
determine the activity level of the computer system so that the system clock can properly be 
adjusted to reduce power consumption. 

BRIEF SUMMARY OF THE INVENTION 

[0009] A battery powered computer system according to the present invention determines 
when the system is not in use by monitoring various events associated with the operation of 
the computer system. In the preferred embodiment, the system monitors the number of cache 
read misses and write operations, i.e., the cache hit rate, and reduces the system clock 
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frequency when the cache hit rate rises above a certain level. When the cache read hit rate is 
above a certain level, then it can be assumed that the processor is executing a tight loop, such 
as when the processor is waiting for a key to be pressed. In an alternate embodiment of the 
invention, the system monitors other events in addition to, or instead of, the cache read hit 
rate, such as the occurrence of page hits or input/output (I/O) write cycles, to determine the 
level of activity of the computer system. 

[0010] The system according to the preferred embodiment includes a frequency switching 
circuit, an event counter, and a periodic timer. The event counter is preferably used to 
measure the incidence of cache read misses and write operations and may also optionally be 
used to count the number of page misses and memory or I/O writes as desired. The event 
counter includes an overflow or carry line which prevents any further incrementing of the 
counter once the maximum number of counts is reached to prevent the counter from 
overflowing. The periodic timer instructs the CPU via a system interrupt to periodically 
monitor and compare the contents of the event counter. Every event increments the counter 
and, the more events, the more processor activity that is presumed. When the periodic timer 
issues a system interrupt, the CPU reads the contents of the counter and compares the event 
activity with a predetermined value. If the number of events is higher than the predetermined 
value, then the processor switches the operating frequency of the system to a high frequency 
if the system is not already operating at this high frequency. A lower event count causes the 
frequency switching circuit to switch to a lower frequency to conserve power if the system is 
not already operating at this low frequency. 

[0011] The invention allows the battery powered operating period of a computer system 
to be greatly extended without requiring any input from the user and without any noticeable 
loss in processing power. This allows a battery powered computer system to have advanced 
capabilities and functionality while still having a satisfactory battery operating interval. 

BRIEF DESCRIPTION OF THE SEVERAL VIEWS OF THE DRAWINGS 

[0012] A better understanding of the invention can be obtained when the following 
detailed description of the preferred embodiment is considered in conjunction with the 
following drawing in which: 
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Figure 1 is a schematic block diagram of a computer system incorporating the present 
invention; 

Figure 2 is a more detailed schematic diagram of a portion of the computer system of 
Figure 1; 

Figure 3 is a schematic diagram of a portion of the computer system of Figure 1 
according to an alternate embodiment of the invention; and 

Figure 4 is a flowchart diagram of a sequence for controlling the operation of the 
computer system of Fig. 1 according to the present invention. 

DETAILED DESCRIPTION OF THE INVENTION 

[0013] Referring now to Figure 1, a computer system C according to the preferred 
embodiment of the invention is shown. The computer system C is preferably based on the 
386 SL chip set produced by Intel Corporation (Intel). The 386 SL chip set comprises two 
chips, a 386 SL CPU chip (CPU chip) 20 and a 82360 SL chip 22. The CPU chip 20 includes 
an 80386 SX microprocessor, a memory controller, a cache controller, a bus controller, clock 
control circuitry, and power management circuitry. The 82360 SL chip 22 includes a 
programmable interrupt controller (PIC), direct memory access (DMA) controller, a memory 
mapper, various ports, a real time clock (RTC) and power management circuitry. For more 
information on the 386 SL chip set, please see the Intel 386 SL Microprocessor Superset 
System Design Guide 1990 edition; the 386 SL Microprocessor Superset Programmers 
Reference Manual, 1990 edition; and the 386 SL Microprocessor Superset Data Book; all 
published by Intel. 

[0014] An oscillator 24 is connected to the CPU chip 20. The oscillator preferably 
operates at 20 Megahertz (MHz) and provides a 20 MHz clocking signal 25 to the CPU chip 
20. Main memory 26, cache memory 28 and a math coprocessor (MCP) 30 are also 
preferably coupled to the CPU chip 20. The math coprocessor 30 is preferably a 387 SX 
coprocessor produced by Intel. The cache memory is preferably operated as a write-through 
cache memory. A video graphics array (VGA) controller 32 is coupled to the CPU chip 20. 
Serial port buffers 36 and a parallel port 38 are coupled to the 82360 SL chip 22. 

[0015] The CPU chip 20 and the 82360 SL chip 22 are each connected to an I/O bus 40 
based on the industry standard architecture (ISA). However, other bus architectures are also 
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contemplated. The ISA bus 40 is connected through a transceiver 42 to a peripheral interface 
(PI) bus 44. The VGA chip 32 is connected to the ISA bus 40. The ISA bus 40 may include 
a plurality of ISA bus expansion slots 46 if the present invention is used in a desktop 
computer system. The slots 46 are generally omitted in portable computers. The ISA bus 40 
is connected through a transceiver 48 to a hard disk 50. Various logic is coupled to the PI bus 
44, including an external real time clock 54, a floppy disk controller 56, a keyboard controller 
58 and BIOS EPROM 60. The floppy disk controller 56 and keyboard controller 58 control 
operation of a floppy disk unit and keyboard, respectively (both not shown). The BIOS 
EPROM holds the basis input/output (I/O) system software as well as system-specific 
initialization and configuration software. This is an exemplary computer system and other 
designs and architectures could be utilized. 

[0016] The CPU chip 20 includes an active high input referred to as the TURBO input 
(not shown). When the TURBO input receives a logic high signal, the CPU chip 20 enters 
"turbo mode," and the microprocessor executes at a clock speed defined by a bit field in a 
register in the CPU chip 20 referred to as the CPUWRMODE register (not shown). When the 
TURBO input receives a logic low signal, the CPU chip 20 enters "de-turbo mode" and 
executes at a reduced clock speed of Vi or % as defined by a bit in a register. The 
CPUWRMODE register operates in conjunction with the power management circuitry inside 
the CPU chip 20 to control the frequency of the clocking signal 25 provided to the 
microprocessor. In the preferred embodiment, the turbo input receives a logic high value, 
and thus the CPUWRMODE register determines the frequency of the clocking signal 25 
provided to the microprocessor. j 

[0017] Bits 5 and 4 of the CPUWRMODE register determine the speed with which the 
microprocessor clock operates. When bits 5 and 4 of the CPUWRMODE register are each 
set to 0, then the clock speed is the speed of the signal received by the oscillator 24, 
preferably 20 MHz. When bits 5 and 4 of the CPUWRMODE register are set to 0 and 1 
respectively, the clock speed provided to the microprocessor is one half of the frequency of 
the signal received from the oscillator 24, i.e., 10 MHz. When bits 5 and 4 of the 
CPUWRMODE register are 1 and 0 respectively, the clock speed provided to the 
microprocessor is one fourth of the frequency of the signal received from the oscillator 24, 
i.e., 5 MHz. When bits 5 and 4 of the CPUWRMODE register each have a logic 1 value, 
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then the clock speed provided to the CPU is one eighth the frequency of the signal received 
from the oscillator 24, i.e. 2.5 MHz. 

[0018] The computer system C also includes activity monitor logic 70 according to the 
present invention which is coupled to the CPU chip 20. The activity monitor logic 70 
receives signals from the CPU chip 20 as well as signals provided from the CPU chip 20 to 
the main memory 26. The activity monitor logic 70 monitors events associated with the 
microprocessor to determine the activity level of the system. In the preferred embodiment, 
the activity monitor logic 70 monitors the number of cache read misses and write operations 
during preset periods of time, and the CPU chip 20 reduces the system clock frequency when 
the cache hit rate rises above a certain level. When the cache read hit rate is above a certain 
level, then it can be assumed that the processor is executing a tight loop, such as when the 
processor is waiting for a key to be pressed. In this instance, the system clock frequency can 
be reduced to reduce power consumption without affecting system performance. In an 
alternate embodiment of the invention, the activity monitor logic 70 monitors other events 
such as page misses or memory and I/O write operations, among others. It is contemplated 
that the various types of events may be counted either individually or in combination. 

[0019] Referring now to Figure 2, the activity monitor logic 70 according to the preferred 
embodiment of the invention is shown. In the description that follows a signal name 
followed by an asterisk indicates that the signal is asserted when it has a logic low level. The 
activity monitor logic 70 receives column address strobe (CAS*) signals provided from the 
CPU chip 20 to the main memory 26. The CAS* signals are. essentially memory select 
signals that are used in paged memory systems. When a cache read hit occurs, the requested 
data can be provided directly from the cache memory 28 to the microprocessor, and no CAS* 
signals are asserted by the CPU chip 20. Also, since the cache memory is operated as a 
write-through cache, each write operation, whether it be a write hit or write miss, requires a 
memory cycle to the main memory 26. Therefore, the CAS* signals are asserted on all write 
operations in the preferred embodiment. Thus, the CAS* signals are used to determine when 
cache read miss operations and write operations occur, i.e., when main memory cycles occur. 

[0020] In the preferred embodiment, the CPU chip 20 generates eight CAS* signals. The 
CAS* signals are provided to eight inputs of a nine input NAND gate 102, in effect ORing 
these signals together. The output of the NAND gate 102 is provided to an input of a two 
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input AND gate 103. A signal referred to as REFREQ output from the 82360 SL chip 22 is 
preferably provided to the other input of the AND gate 103, which is an inverted input. The 
REFREQ signal indicates, when asserted high, that a memory refresh operation is occurring. 
The inverted REFREQ signal is ANDed with the output of the NAND gate 102 to prevent 
memory refresh operations from being counted as cache read miss or write operations. 

[0021] The output of the AND gate 103 is a signal referred to as CACHE JV1ISS, which is 
provided to the clock input of a 17 bit counter 104 referred to as the CAS* counter. The 
CACHE_MISS signal is therefore asserted when a main memory cycle occurs, which is 
whenever a cache read miss or a write operation occurs. The CAS* counter 104 is 
incremented each time the CACHEMISS signal is asserted. The CAS* counter 104 outputs 
a signal referred to as CARRY which is provided through an inverter 106 to the seventh input 
of the NAND gate 102. The CARRY signal is asserted when the maximum number of counts 
is reached by the counter 104, which is 2 17 . When the CARRY signal is asserted, the 
CACHEJMISS signal is prevented from changing state and thus further counting by the 
counter 104 is disabled. The CAS* counter 104 provides data signals 1-16 to the inputs of a 
16 bit flip-flop 108. Data signal 0 output from the CAS* counter is preferably left 
unconnected. The outputs of the flip-flop 108 are coupled to data bus signals referred to as 
DATAO: 1 5>, which are coupled to the CPU chip 20. 

[0022] Address signals, a write/read (W/R) signal and a memory/input output (M/IO) 
signal are output from the CPU chip 20 and provided to decode logic 110 in the activity 
monitor logic 70. The decode logic 110 outputs a signal referred Jo as RD COUNT which is 
provided to a clock input of the flip-flop 108. The RD_COUNT signal is also provided 
through an inverter 1 1 1 to an inverted clear input of the counter 104 and to an inverted enable 
input of the flip-flop 108. Therefore, when the CPU chip 20 desires to read the counter 104, 
it outputs the appropriate address, the W/R signal and the M/IO signal to the decode logic 
110, which then asserts the RD_COUNT signal to the counter 104 and the flip-flop 108. The 
asserted RD_COUNT signal enables the flip-flop 108 to latch in data from the counter 104 
and transmit the data to the CPU chip 20 and simultaneously operates to clear the contents of 
the counter 104. 
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[0023] In an alternate embodiment, memory page misses and/or I/O write operations may 
also be counted. Memory page miss operations with the preferred microprocessor are 
determined by determining if any of the 4 row address strobe or RAS* signals are asserted 
low during non-refresh cycles. The asserted RAS* signals are indicative of page miss 
operations. As shown in Figure 3, the RAS* signals are provided from the CPU chip 20 to 
the inputs of a four input NAND gate 120. The output of the NAND gate 120 is the page 
miss indication, which is referred to as the PAGEJVfISS signal. The I/O write indication is 
accomplished in the following manner. The W/R signal is connected to an input of a two 
input AND gate 124. The other input of the AND gate 124 is inverted and receives the M/IO 
signal. The output of the AND gate 124 generates a signal referred to as IOW which 
indicates, when asserted high, that an I/O write operation is occurring. 

[0024] The PAGEJMISS and IOW signals are connected to inputs of a three input OR 
gate 126. The eight CAS* signals are connected to the input of an eight input NAND gate 

125 whose output is connected to an input of the OR gate 126. The output of the OR gate 

126 is connected to an input of a three input AND gate 128. A second input of the AND gate 
128 receives the inverted CARRY signal. The third inverted input of the AND gate 128 
receives the REFREQ signal. The output of the AND gate 128 is provided to the clock input 
of the counter 104. Therefore, during memory refresh operations or when the CARRY signal 
is asserted, the output of the AND gate 128 is negated low, and thus the counter 104 does not 
count at these times. 

[0025] In this embodiment, cache read miss and write operations, page miss operations, 
and I/O write operations are all monitored as events. It is noted that any one of these events 
can be monitored either individually or in combination. In yet another alternate embodiment, 
each signal can be provided to separate counters arranged like the counter 104, and the 
outputs of the separate counters can be used to determine system activity. 

[0026] The 82360 SL chip 22 includes an interrupt timer (not shown) which is used to 
generate time of day interrupts to the microprocessor approximately 18.2 times per second. 
The interrupt routine invoked by the timer is also used according to the preferred embodiment 
of the invention to periodically adjust the frequency of the clocking signal 25 received by the 
microprocessor, as is explained below. 



9 



( 

052617.1298 (P-295-5 CPCG-PPC) 



( 



[0027] Referring now to Figure 3, a portion of the software routine that is executed when 
the time of day interrupt routine is invoked is shown. It is understood that the time of day 
interrupt routine may perform other operations than those shown. When the time of day 
interrupt signal is asserted, the CPU chip 20 reads the CAS* counter 104 in step 202. The 
counter 104 is also cleared in step 202. In an alternative embodiment, the separate counters 
providing the CAS* count, the memory page miss count, and the I/O write count are read in 
step 202. In step 204, the microprocessor reads bits 5 and 4 of the CPUWRMODE register to 
determine if the clock signal provided to the CPU chip 20 is operating at a fast or slow 
frequency. In the preferred embodiment, the microprocessor only writes either values 0,0 or 
1,0 to bits 5 and 4, respectively, of the CPUWRMODE register, thus providing the clocking 
signal 25 at either full frequency or one quarter frequency to the microprocessor. The 
CPUWRMODE register is programmed depending on the cache read hit rate as determined 
by the CAS* counter 104 in relation to certain comparison values. Thus, in this embodiment, 
the power management logic toggles between a full clock speed or fast speed, preferably 20 
MHz, and a one quarter frequency clock speed or slow speed, preferably 5 MHz. However, it 
is noted that all four clock speed frequencies may be utilized. 

[0028] If the clocking signal 25 provided to the CPU is determined to be operating at the 
slow speed in step 204, then in step 206 the microprocessor determines if the CAS* count 
value- received from the CAS* counter 104 is greater than a fast comparison value stored 
inside the CPU chip 20. Alternatively, a function based on the three separate count values 
could be evaluated. If the CAS* count value exceeds the fast comparison value in step 206, 
then in step 208 the power management logic in the CPU chip 20 speeds up the clock signal 
25 provided to the microprocessor to the full clock speed, and the routine completes. If the 
CAS* count value is not greater than the fast comparison value in step 206, then the interrupt 
routine completes. 

[0029] If the clocking signal 25 provided to the microprocessor is determined to be 
operating at the fast speed in step 204, then control proceeds to step 212 where the 
microprocessor determines whether the CAS* count value is less than a slow comparison 
value stored in the CPU chip 20. If the CAS* count value is less than the slow comparison 
value in step 212, then the power management logic in the CPU chip 20 slows down the 
clocking signal 25 provided to the microprocessor in step 214 to one quarter speed, and the 
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routine completes. If the CAS* count value is not less than the slow comparison value in step 
212, then the routine completes. 

[0030] The fast and slow comparison values are preferably programmable inside the CPU 
chip 20. In the preferred embodiment, the fast and slow comparison values are the same 
value. In an alternate embodiment, the slow comparison value is proportionally less that the 
fast value, i.e., one fourth less to reflect a true hit/miss ratio. In addition, since the comparison 
values can be user-adjusted inside the CPU chip 20, the values could be derived from 
characterization of common user applications. 

[0031] As described above, the preferred embodiment utilizes the CPU chip 20 in "turbo 
mode" and uses software to perform the activity comparisons and speed changes. In an 
alternative embodiment, additional registers can be used to contain the comparison values 
and an additional timer can be used to define the comparison interval. Hardware comparators 
are configured to perform the comparison logic described above at the appropriate time and 
change the state of the TURBO input based on the comparison to speed up or slow down the 
CPU chip 20 as appropriate. 

[0032] Thus, the computer system according to the present invention monitors various 
power consumption related events, indicates certain changes to the user and enters an 
inactivity state upon an appropriate period of time after monitored system devices have been 
used. 

[0033] The foregoing disclosure and description of the invention are illustrative and 
explanatory thereof, and various changes in the components, circuit elements, or flowcharts 
may be made without departing from the spirit of the invention. 
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